Intercept Guidance of Maneuvering Targets with Deep Reinforcement Learning

نویسندگان

چکیده

In this paper, a novel guidance law based on reinforcement learning (RL) algorithm is presented to deal with the maneuvering target interception problem using deep deterministic policy gradient descent neural network. We take missile’s line-of-sight (LOS) rate as observation of RL and propose reward function, which constructed miss distance LOS train network off-line. process, trained has capacity mapping normal acceleration missile directly, so generate commands in real time. Under actor-critic (AC) framework, we adopt twin-delayed (TD3) by taking minimum value between pair critics reduce overestimation. Simulation results show that proposed TD3-based outperforms current state law, better performance cope continuous action space, also faster convergence speed higher reward. Furthermore, accuracy robustness when intercepting target, converged.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Missile Guidance Law Based on Sontag’s Formula to Intercept Maneuvering Targets

In this paper, we propose a nonlinear guidance law for missiles against maneuvering targets. First, we derive the equations of motion described in the line-of-sight reference frame and then we define the equilibrium subspace of the nonlinear system to guarantee target interception within a finite time. Using Sontag’s formula, we derive a nonlinear guidance law that always delivers the state to ...

متن کامل

Integrated fuzzy guidance law for high maneuvering targets based on proportional navigation guidance

An integrated fuzzy guidance (IFG) law for a surface to air homing missile is introduced. The introduced approach is a modification of the well-known proportional navigation guidance (PNG) law. The IFG law enables the missile to approach a high maneuvering target while trying to minimize control effort as well as miss-distance in a two-stage flight. In the first stage, while the missile is far ...

متن کامل

A Comprehensive Approach to Develop a Continuous Fuzzy Guidance Law for Maneuvering Targets

Based on the idea of Continuous Fuzzy Guidance Law (CFGL), a andldquo;three-phase fuzzy guidanceandrdquo; (TFG) law is proposed for the class of surface to air homing missiles. The current approach enables the guidance law to track a maneuvering target from the beginning of the launch phase up to the terminal one while itdynamically attempts to keep miss-distance, flight time and control effort...

متن کامل

Operation Scheduling of MGs Based on Deep Reinforcement Learning Algorithm

: In this paper, the operation scheduling of Microgrids (MGs), including Distributed Energy Resources (DERs) and Energy Storage Systems (ESSs), is proposed using a Deep Reinforcement Learning (DRL) based approach. Due to the dynamic characteristic of the problem, it firstly is formulated as a Markov Decision Process (MDP). Next, Deep Deterministic Policy Gradient (DDPG) algorithm is presented t...

متن کامل

Deep Reinforcement Learning with POMDPs

Recent work has shown that Deep Q-Networks (DQNs) are capable of learning human-level control policies on a variety of different Atari 2600 games [1]. Other work has looked at treating the Atari problem as a partially observable Markov decision process (POMDP) by adding imperfect state information through image flickering [2]. However, these approaches leverage a convolutional network structure...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Aerospace Engineering

سال: 2023

ISSN: ['1687-5966', '1687-5974']

DOI: https://doi.org/10.1155/2023/7924190